Comparison of classic and hybrid HMM approaches to speech recognition over telephone lines
ثبت نشده
چکیده
The subject of the present dissertation is the automatic speaker-inde¬ pendent recognition of isolated German digits spoken (by Swiss people) over the public switched telephone network. The approaches considered for this task are all based on hidden Markov models (HMMs). In ad¬ dition to the classic HMM approaches, several connectionist ideas are investigated in order to improve the discrimination capability of the classic HMM systems. In the first part, the theoretical foundations of HMMs in general and discrete density HMMs (DDHMMs) in particular are summarized. After that, several connectionist ideas, also termed hybrid HMM sys¬ tems in the sequel, are theoretically discussed. In particular, a socalled connectionist-SCHMM approach based on classic semi-continuous HMMs (SCHMMs), which has been proposed by the author, is intro¬ duced. The second part goes into the details of the design of the experimen¬ tal system RECO, which is based on DDHMMs. It covers the speech data collection for the training and test data as well as the different ex¬ periments that have led to the high performance of the recognizer. With 98.6 % word recognition rate, the resultant DDHMM recognizer yields the same performance as the COST 232 reference recognizer based on continuous density HMMs on the same speaker-independent test set. Different connectionist ideas of the first part are compared with each other and with classic HMM approaches in the last part of the thesis. The comparison shows, on the one hand, that the connectionistSCHMM system proposed exhibits the best performance of all hybrid
منابع مشابه
Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملA Comparison of Hmm and Neural Network Approaches to Real World Telephone Speech Applications
WORLD TELEPHONE SPEECH APPLICATIONS Pieter Vermeulen, Etienne Barnard, Yonghong Yan, Mark Fanty and Ronald Coley Center for Spoken Language Understanding, Oregon Graduate Institute of Science and Technology 20000 N.W. Walker Road, P.O. Box 91000, Portland, OR 97291-1000, USA Tel: +1 503-6901484, E-mail: [email protected] ABSTRACT We compare a standard HMM based and a neural network based appro...
متن کاملThe cascade HMM/ANN hybrid: A new framework for discriminative training in speech recognition
In this paper, a new formulation for discriminative training of HMMs is presented. This formulation uses a properly trained MLP in a simple interconnection with HMMs called “Cascade HMM/ANN Hybrid”. Our training algorithm has simple realization in comparison with other discriminative training for HMMs such as MDI and MMI. We also present a rigid mathematical proof of its convergence. We found t...
متن کاملشبکه عصبی پیچشی با پنجرههای قابل تطبیق برای بازشناسی گفتار
Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...
متن کاملConnectionist Probability Estimators in Hmm Using Genetic Clustering Application for Speech Recognition and Medical Diagnosis
The main goal of this paper is to compare the performance which can be achieved by five different approaches analyzing their applications’ potentiality on real world paradigms. We compare the performance obtained with (1) Multi-network RBF/LVQ structure (2) Discrete Hidden Markov Models (HMM) (3) Hybrid HMM/MLP system using a Multi LayerPerceptron (MLP) to estimate the HMM emission probabilitie...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008